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8.  STATEMENT  OF  PROBLEM 


The  aim  of  this  project  was  to  study  the  structure,  synthesis,  intraprotein  and  protein-protein 
interactions  of  a  family  of  secreted  silk  proteins  that  are  synthesized  in  larval  salivary  glands  of 
the  midge,  Chironomus  tentans.  These  silk  proteins  are  mainly  composed  tandem  copies  of 
novel  repeated  amino  acid  sequences  and  assemble  in  vivo  into  complexes  that  form  insoluble 
silk  fibers.  Repeated  domains  from  silk  proteins  made  provide  unique  and  useful  protein-based 
polymers. 


9.  SUMMARY  OF  IMPORTANT  RESULTS 

(Superscripts  refer  to  PUBLICATIONS  listed  in  section  10.) 


A.  Chironomus  Silk  is  Unique 

Why  study  the  silk  of  a  lesser  known  Dipteran,  like  Chironomus,  when  silk  is  spun  by 
more  well  known  arachnids  and  several  insects  orders?  First,  while  most  known  silks  have 
similar  amino  acid  compositions,  Chironomus  silk  is  the  only  one  with  significant  amounts  of 
cysteine  (Cys)5.  Second,  the  molecular  biological  data  base  for  Chironomus  silk  proteins 
(including  but  not  limited  to  the  chromosomal  location,  organization  and  developmentally 
regulated  expression  of  thirteen  silk  protein  genes  and  their  encoded  primary  amino  acid 
sequences)  exceeds  that  for  any  other  organism1.  Third,  intermediate  and  large  size  classes  of 
silk  proteins  are  composed  of  50-130  tandem  copies  of  novel  amino  acid  sequences  punctuated 
by  conserved  Cys-containing  motifs1.  The  uniquness  of  these  proteins  provides  an  opportunity 
to  study  a  natural  biopolymer  with  potential  application  of  biomolecular  materials. 

B.  Silk  Protein  Interactions  in  Vivo 

Insoluble  Chironomus  silk  is  spun  from  a  reservoir  of  soluble  silk  proteins  in  the  salivary 
gland  lumen.  Optical  characterization  of  lumenal  proteins  indicates  their  ability  to  form  nematic 
liquid  crystals  that  likely  assemble  from  supermolecular  aggregates4,7.  Presumably,  these 
"aggregates"  form  the  networks  of  multi-stranded  beaded  fibers  which  we  previously 
demonstrated  can  be  disassembled  and  reassembled  in  vitro.  Development  of  a  suitable  system 
for  non-denaturing  agarose  gel  electrophoresis9  provided  initial  evidence  for  the  existence  of  such 
"aggregates"  which  were  subsequently  identified12  as  discrete  silk  protein  complexes:  complex 
Cla  nominally  contains  spla  +  spl85  +  spl40;  Clb  contains  splb  +  spl85.  These  complexes 
are  primarily  stabilized  by  non-covalent  interactions,  though  a  small  percentage  of  molecules 
exhibit  intermolecular  disulfide  bonds.  Other  proteins  (sp40,  spl7  and  spl2)  co-sediment  with 
Cla  and  Clb  and  may  also  be  part  of  these  complexes  which,  in  turn,  aggregate  into  multimeric 
oligomers.  Isolated  complexes  assemble  in  vitro  into  multi-stranded  beaded  fibers12,  the 
precursors  to  silk  threads. 

C.  Synthetic  and  Recombinant  Silk  Protein  Interactions  in  Vitro 
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The  paucity  and  heterogeneity  of  natural  silk  proteins  renders  them  unsuitable  reagents 
for  in-depth  structural  studies.  To  overcome  this  limitation,  we  synthesized  peptides  and 
recombinant  proteins  as  model  substrates  for  detailed  biophysical  and  biochemical  studies.  Since 
spls  form  the  fibrous  backbone  of  silk,  we  chose  to  study  their  core  repeats.  The  "constant 
regions"  of  these  repeats  has  four  Cys  residues  that  in  synthetic  peptides6  and  recombinant 
proteins3,8,10  reversibly  form  two  disulfide  bonds.  The  major  focus  of  this  work  was  rCAS,  a 
recombinant  Constant  And  Subrepeat  (rCAS)  protein  modeled  after  a  core  repeat  from  spla3. 

Characterization  of  rCAS  included  polyacrylamide  gel  electrophoresis,  N-terminal  amino 
acid  sequencing,  quantitative  amino  acid  composition,  quantitative  sulfhydryl  assays,  matrix- 
assisted  laser  desorption/time-of-flight  and  electrospray  ionization  mass  spectrometry  and 
mapping  disulfide-containing  tryptic  peptides10.  As  purified,  there  are  three  rCAS  conformers 
that  differ  in  their  Cys-pairing.  Denatured  and  reduced  rCAS  was  subjected  to  refolding  at  pH 
8  and  10  with  02-  and  DTT°x-mediated  formation  of  disulfide  bonds.  The  kinetics  of  refolding 
was  monitored  at  different  times  by  HPLC.  While  the  rates  of  the  reactions  varied,  in  all 
instances,  reduced  rCAS  passed  through  one  disulfide-containing  intermediates  on  the  way  to 
reforming  two  disulfide  bonds;  however,  heterogeneity  of  Cys  pairs  persisted  suggesting  that 
rCAS  may  have  more  than  one  stable  structure.  The  occasional  appearance  of  rCAS  dimers  and 
trimers  prompted  us  to  investigate  aggregation  in  solution  by  analytical  equilibrium 
ultracentrifugation.  At  protein  concentrations  ranging  from  0.25-1.0  mg/ml  in  100  mM  Na+, 
rCAS  is  a  monomer10. 

Using  in  vitro  mutagenesis,  a  set  of  20  genes  was  made  that  encompass  all  single,  double, 
triple  and  quadruple  Cys-»Ala  substitutions.  Most  also  include  a  Phe-*Trp  substitution  that 
provides  an  intrinsic  fluorochrome  at  one  specified  site.  We  subcloned  19/20  mutant  genes  into 
expression  vector  pET3a  and  obtained  inducible  synthesis  of  all  recombinant  proteins.  Selected 
variants  are  being  purified  and  subjected  to  biochemical  verification  of  their  encoded  amino  acid 
substitutions  by  amino  acid  composition,  sulfhydryl  counting  gels,  mass  spectrometry,  and  UV 
absorption  and  fluorescence  spectroscopies.  Preliminary  data  are  exciting.  For  example,  one 
Cys-*Ala  substitution  alters  reformation  of  disulfide  bonds8  and  the  quantum  yield  of 
fluorescence  for  QUAD  (Cys9)2i>23t79-^Ala)  is  similar  to  free  Trp  whereas  F16W  exhibits  a 
decreased  yield  that  can  be  increased  by  denaturation.  This  suggests  that  while  the  sole  Trp  in 
F16W  is  buried,  in  QUAD,  which  lacks  Cys  (hence  no  disulfide  bonds),  it  is  exposed  to  the 
solvent.  These  results  have  set  the  stage  for  a  detailed  structural  and  thermodynamic  comparison 
of  these  proteins. 

D.  Related  Proteins 

Several  collaborations  have  enabled  us  to  examine  other  Cys-containing  proteins  and  gain 
experience  with  biophysical  methods.  For  example,  the  homologue  of  C.  tentans  spl85  (motif 
Cys-X-Cys-X-Cys  every  22-26  residues)  has  been  identified  in  two  other  species  of 
Chironomus2.  Cys-rich  chorion  (egg-shell)  shell  proteins  in  two  Lepidopteran  have  been 
examined11.  Native,  synthetic  and  recombinant  proteins  have  been  examined  by  a  variety  of 
spectrosocpic  methods.  Most  notably  we  have  acquired  the  capacity  to  conduct  FUR  of  proteins 
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in  solution;  this  offers  advantages  over  the  typical  acquisition  of  spectra  in  KBr  pellets.  A 
spectral  library  of  known  proteins  has  been  assembled  to  provide  basis  spectra  from  which 
quantitative  estimates  of  the  percentages  of  secondary  structures  can  be  made  for  unknown 
proteins. 
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